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RANDOM MATRICES: TAIL BOUNDS FOR GAPS BETWEEN EIGENVALUES 


HOI NGUYEN, TERENCE TAO, AND VAN VU 


Abstract. Gaps (or spacings) between consecutive eigenvalues are a central topic in random matrix theory. 
The goal of this paper is to study the tail distribution of these gaps in various random matrix models. We 
give the first repulsion bound for random matrices with discrete entries and the first super-polynomial bound 
on the probability that a random graph has simple spectrum, along with several applications. 


1. Introduction 

Let Mn be a random Hermitian matrix of size n, with eigenvalues Ai < • • • < A„. In this paper, we study the 
tail distribution of the gap (spacing) 5i := A^+i — A^, and more generally the tail distribution of A^+j — A^, 
where I is fixed and 1 < i < n — l. We are going to consider the following classes of random matrix ensembles. 

Wigner matrices. A Wigner matrix A„ is a random Hermitian matrices whose strictly upper triangular 
entries are iid sub-gaussian random variables with mean 0 and variance 1 and whose diagonal entries are 
independent sub-gaussian random variables with mean 0 and variances bounded by with the diagonal 

entries independent of the strictly upper triangular entries. Here and in the sequel we say that ^ is a sub- 
gaussian random variable with sub-gaussian moment mo > 0 if one has P(|^| > t) < exp(—mot^) for all 
t > 0. 

Adjacency matrix of random graphs. Let G(n,p) be the Erdos-Renyi graph on n vertices with edge density 
p. We denote by An{p) the (zero-one) adjacency matrix of G{n,p). 

Random matrix with arbitrary mean. We consider a random Hermitian matrix of the form Mn ■= Fn+Xn, 
where F = Fn is a. deterministic symmetric matrix of size n and of norm ||E)j ||2 = and A„ is a random 

Hermitian matrix where the off-diagonal entries , 1 < * < J < n. are iid copies of a random real variable 
^ of zero mean, unit variance, and finite (2 -|- eo)“tb moment with fixed Eq > 0. The diagonal entries are 
independent random variables with mean zero and variance bounded by 0{1). 


Here and later all asymptotic notations are used under the assumption that n —>■ oo. 


Gaps between consecutive eigenvalues have a central place in the theory of random matrices. The limiting 
(global) gap distribution for gaussian matrices (GUE and GOE) has been known for some time [T7]. Recent 
progresses on the universality conjecture showed that these limiting distributions are universal with the class 
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of Wigner matrices; see HU El I32j for surveys. However, at the microscopic level, there are many open 
problems concerning basic questions. In the following discussion, when we say limiting distribution, we 
always mean after a proper normalization. 

The first natural question is the limiting distribution of a given gap 5i := Ai+i — Xi. For GUE, this 
was computed very recently by the second author [26]. Within the class of Wigner matrices, the four 
moment theorem from |34l Theorem 15] asserts that this distribution is universal, provided the four matching 
moment condition. The matching moment condition was recently removed by Erdos and Yau m using 
sophisticated techniques from the theory of parabolic PDE to analyze a Dyson Brownian motion, allowing 
for a computation of the gap distribution for random matrix ensembles such as the GOE or Bernoulli 
ensembles. 

Another issue is to understand the size of the minimum gap := mini<i<„_i(5i+i — 5i). Eor the GUE 
ensemble, Bourgade and Ben-Arous |3] showed that the minimum gap dmin is of order and computed 

the limiting distribution. To our best knowledge we are not aware of a polynomial lower bound (of any fixed 
exponent) for 5min for discrete random matrices, which are of importance in applications in random graph 
theory and theoretical computer science. Even proving that > 0 (in other words the random matrix 
has simple spectrum) with high probability in the discrete case is already a highly non-trivial problem, first 
raised by Babai in the 1980s (motivated by his study of the isomorphism problem |^). This latter problem 
was solved only very recently by the last two authors |35j . 

Our main goal is to provide lower tail bounds for the gaps (5^, and hence on the minimum gap ^min- For 
the model A„, by Wigner’s semi-circle law most eigenvalues are in the interval [—2y/n,2^/ri], thus the 
average gap is of order The question is to estimate the probability that a particular gap is significantly 

smaller than the average: 

Question 1.1. Estimate P((5i < where S may tend to zero with n. 


As well known, tail bounds (or deviation) are essential tools in probability, and we believe that good bounds 
for the probability in question will have a wide range of applications. 

Let us first discuss a few results related to this question. The last two authors showed [M] that for every 
constant cq > 0 there exists ci > 0 such that for Wigner matrices and for fixed £ > 0 one has 


sup P{Si<n 2 )^ 7 j cy 

en< 2 <(l —e)n 


The restriction to the bulk region en < i < {l — e)n was removed in |3()j . and the mean zero requirement was 
partially removed in [20j . The weakness of this theorem is that ci is small (much smaller than 1, regardless 
the value of Cq), and thus one cannot use the union bound to conclude that <5^ > 0 simultaneously for all i. 

In [12], Erdos et. al. proved for real Wigner matrices 


1 


sn<i<{l — e)n 


n 
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for any constant e, (5 > 0, with a similar result also available at the edge of the spectrum. The quadratic 
decay <5^ here comes from an eigenvalue repulsion phenomenon, reflecting the first-order decay of the two- 
point correlation function p 2 {x,y) of the GOE ensemble as one approaches the diagonal x = y. However, 
this result only give a bound on the average probability, and furthermore 5 needs to be a constant. 

Under some strong smoothness and decay hypotheses on the entries of a Hermitian Wigner matrix Xn, it 
was shown by Erdos, Schlein, and Yau m that one has the Wegner estimate 

P < Aj < Xi+k-i < for some (2) 

for any fixed k > 1 and any £ > 0 and any bounded E € M. (note that the normalization in [T3] is different 
from the one used here). Setting k = 2 and applying the union bound (together with a standard bound on 
the operator norm of Xn), we conclude that 

P(^min < <C nd^ + exp(—cn) 

for some absolute constant c > 0. This is a strong (and essentially optimal) bound for small <5, but it is 
only established for very smooth and rapidly decreasing complex probability distributions on the entries; in 
particular it does not apply at all to discrete ensembles such as the Bernoulli ensemble. An analogue of © 
for real smooth ensembles, with the exponent k^ replaced by k{k + l)/2, was established in [H Appendix B]. 

Einally, in [3S], the last two authors established the bound 

P(5, = 0) < n-^ 

for all 1 < f < n and any fixed A > 0, for any of the three random matrix models (Wigner, Erdos-Renyi, 
random matrix with arbitrary mean) considered above. By the union bound (and shifting A by 1), this also 
gives 

P(^mi„ = 0) < n-^. 

In this paper, we are going to give answers to Question o and also to the more general question of 
bounding P(Ai+/ — Xi < for a fixed 1. As with [35], our method is based on probabilistic inverse 

Littlewood-Offord theorems (avoiding the use of moment comparison theorems or analysis of Dyson Brownian 
motion), and works for any of the three random matrix ensembles introduced above, without requiring any 
smoothness on the distribution of the entries. (However, the two special models of Wigner matrices and 
adjacency matrices allow a more delicate analysis leading to better bounds.) 


2. Main results 

For the sake of applications, we will be mainly focusing on real symmetric matrices. All results can be 
extended to the complex case. 

We begin with the Wigner model, where our results are strongest. Our first main theorem is as follows. 

Theorem 2.1 (Lower tail bound for a single gap). There is a constant 0 < c < 1 (depending on the sub- 
gaussian moments) such that the following holds for the gaps Si := Ai+i(A„) — Xi{Xn) of real symmetric 
Wigner matrices Xn- For any quantities < ct < c and S > we have 


sup 

I<i<n-1 


P(,5i < Sn-^) = 0(- 


S 
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Notice that there is a trade-off between the quality of the bound and the range of <5. At the one end, taking 
a to be a small constant, we have 

Corollary 2.2. For any constant Cq > 0 there is a constant Cq > 0 such that for real symmetric Wigner 
matrices Xn, we have 


sup P(^i < 6n 2 ) < cqS, 
for all 6 > n~^°. Consequently, by the union bound, 


with probability 1 — o(l). 




At the other end, taking a = n we have 

Corollary 2.3. Let Xn be a real symmetric Wigner matrix where the upper triangular entries are Bernoulli 
(taking value ±1 with probability 1 / 2 ), then Xn has multiple eigenvalues with probability at most 0(exp(—n°)) 
for some constant c > 0 . 

This corollary improves the polynomial bound in [3^ for this specific case, although the result in [3^ holds 
in a more general context (the main assumption being that the distribution of any individual entry on the 
matrix is not concentrated at a single point). We conjecture that the probability that a Bernoulli matrix 
has multiple eigenvalues is in fact 0 (exp(—cn)) for some constant c > 0 . 

In the next theorem, we treat the gap Ai+j — A^, where Z > 1 is fixed. Let d := [log 2 , and set 


(31 + 3 

Cl ■■= - 


2d+i^2'^ _ I 
~3 


Thus C 2 = 3 and c; > ^ 3 ^^ 

Theorem 2.4 (Repulsion for multiple gaps). There is a constant 0 < c < I (depending on the sub-gaussian 
moments and on I) such that the following holds for real symmetric Wigner matrices Xn- For any quantities 
< a < c and S > 


sup P(|Ai+i(X„) - Ai(X„)| < (5n = O 

l<z<n—Z 



Similar to Corollary |2.2[ we have 

Corollary 2.5. For any constant l,Co > 0 there is a constant cq > 0 such that for real Wigner matrices Xn 


sup P(|Ai+/(A:„) - Ai(X„)| < dn 2 )<co(5°', 

for all S > n~^°. 
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The key feature of this result is that the bound S‘^‘ yields evidence for the repulsion between nearby eigen¬ 
values. For I = 2, we have c; = 3, yielding the bound 0{5^). If there was no repulsion, and the eigenvalues 
behave like point in a Poisson process, then the bonnd would be 0{6^) instead. The bound c; > seems 
to be sharp, up to a constant factor; compare with ([^. 

We next consider the general model := Fn +^n- It is often useful to view M„ as a random perturbation 
of the deterministic matrix Fm especially with respect to applications in data science and numerical analysis, 
where matrices (as data or inputs to algorithms) are often perturbed by random noise. One can consult 
for instance |33j where this viewpoint is discussed with respect to the least singular value problem. As an 
illustration for this view point, we are going to present an application in numerical analysis. 

Theorem 2.6. Assume ||Fri ||2 < rF for some constant 7 > 0. Then for any fixed A > 0, there exists B > 0 
depending on 7, A such that 

sup P((5i < n~^) = 0(n“"^), 

l<i<n-l 

where 5i := Ai+i(M„) — Xi{Mn) is the gap of a random matrix = Fn + Ai„ with arbitrary mean. In 
particular, the matrix has simple spectrum with probability 1 — 0{n~^) for any fixed A > 0. 


This theorem shows that eigenvalue separation holds regardless the means of the entries. The dependence of 
B in terms of A and 7 in Theorem 2.6 can be made explicit, for instance one can (rather generously) assume 
B > (5A -I- 6 ) max{l/2, 7 } -|- 5. 


Finally, let us focus on the adjacency matrix of the random graph G(n,p), where 0 < p < 1 is a constant. 
Set Fn := p{Jn — In), where Jn is the all-one matrix and is the identity. In this case, we can sharpen the 
bound of Theorem 12.61 to almost match with the better bound in Theorem 12.11 

Theorem 2.7. Let 0 < p < 1 be independent of n, and let A„ be the adjacency matrix of the random 
graph G{n,p). Let 6i := Ai_|_i(A„) — Ai(A„) denote the eigenvalue gaps. Then for any fixed A > 0, and any 
S > n~^, we have 

sup P(i5i < (5n“5) = 0(n°^^^<5). 

l<i<ra-l 


All of our results extend to the Hermitian case. In this case the upper triangular entries are complex 
variables whose real and complex components are iid copies of a sub-gaussian random variable of mean zero 
and variance 1/2. The value of c; in Theorem 2.4 doubles; see Remark 6.3 for a discussion. 


Our approach also works, with minor modifications, for random matrices where the variance of the entries 
decays to zero with n. In particular, we can have them as small as for any fixed c > 0. This case 

contains in particular the adjacency matrix of sparse random graphs. Details will appear elsewhere. 


3. Applications 

3.1. Random graphs have simple spectrum. Babai conjectured that G(n, 1/2) has a simple spectrum, 
with probability 1 — o(l). This conjecture was recently settled in [3S]. Using the new deviation bounds, we 
can have the following stronger statement. 

Theorem 3.1. With probability 1 — o(l), the gap between any two eigenvalues o/G(n, 1/2) is at least 

„-3/2+o(l)^ 
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To prove this theorem, apply Theorem 2.7 with 5 = n ^ 


and then apply the union bound. 


3.2. Nodal domains of random graphs. Consider a random graph G{n,p) (with p constant) and the 
accompanying matrix Anijp). Let u be an eigenvector of An{p). A strong nodal domain (with respect to u) is 
a maximal connected component S of the graph, where for any i, j € S', UiUj > 0 (here Ui is the coordinate 
of u corresponding to i). A weak nodal domain is defined similarly, but with UiUj > 0. Notice that strong 
nodal domains are disjoint, while the weak ones may overlap. 

The notion of (weak and strong) nodal domains comes from Riemann geometry and has become increasingly 
useful in graph theory and algorithmic applications (see [S [3211 [318]). In Dekel, Lee and Linial studied 
nodal domains of the random graph G(n,p) and raised the following conjectured [3 Question 1]. 

Conjecture 3.2. With probability 1 — o(l), all eigenvectors of G{n,p) do not have zero coordinates. In 
other words, weak and strong nodal domains are the same. 


We can now confirm this conjecture in the following stronger form 

Theorem 3.3 (Non-degeneration of eigenvectors). Assume that Mn = A„ -|- Fn as in Theorem 2.6 Then 
for any A, there exists B depending on A and 7 such that 

P^3 an eigenvector v = (vi,... ,r„) of Mn with |ri| < n~^ for some ij = 0(n~^). 


To make the picture complete, let us mention that recently Arora et. al. [5] proved that with high probability, 
G{n,p) has only two weak nodal domains, one corresponds to coordinates Ui > 0, and the other to Ui < 0. 
Combining this with Theorem |3.3[ we have 

Corollary 3.4. The following holds with probability 1 — o(l) for G(n,p). Each eigenvector has exactly two 
strong nodal domains, which partition the set of vertices. 


3.3. Numerical Analysis. Our results can also be used to guarantee polynomial running time for certain 
algorithms. Let us consider an example. A basic problem in numerical analysis is to compute the leading 
eigenvector and eigenvalue of a large matrix. A well-known power iteration method, designed for this purpose, 
works as follows. Given F, a large symmetric matrix as input, let uq := u be an arbitrary unit vector and 
consider 


Fuk 

WFu.h' 

For simplicity, assume that F is positive semi-definite and let 0 < Ai < • • • < A„ be the eigenvalues of F 
(with, say, corresponding eigenvectors Vi,... ,Vn). If A„ is strictly larger than A„_i, then Uk converges to 
and ||F'wfc ||2 converges to A„. 

The heart of the matter is, of course, the rate of convergence, which is geometric with base This means 

that to obtain an e error term, we need to iterate 8( ^ —- log I) steps (see for instance [13 Chapter 8].) 

For simplicity, assume that A„ = ||A ||2 = The algorithm is efficient (run in polynomial time) if 

An ~ An-i is polynomially large. However, if A„ — A„_i is exponentially small in n, then the algorithm take 
exponentially many steps. 
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One can use our result to avoid this difficulty. The idea is to artificially perturb F by a random matrix Xn- 
Theorem 2.6 shows that with high probability, the gap Xn{F + X„) — A„_i(T + X„) is polynomially large, 
which in turn guarantees a polynomial running time. On the other hand, adding (or a properly scaled 
version of it) will not change A„ and Vn significantly. For instance, one can immediately apply Weyl’s bound 
here, but better results are available in more specific cases. 


This argument is closely related to the notion of smoothed analysis introduced by Spielman and Teng [24) . 
We will discuss similar applications in a separate note. 


4. Proof strategy 


We first consider Theorem 2.1 In what follows, all eigenvectors have unit length. Assume that the entries 
^ij are iid copies of a random variable Consider a fixed vector x = (xi,... ,a;„), we introduce the small 
ball probability 


ps{x) := supP(|^iXi H- \-^nXn “ a| < |5)- 

oGR 


where ^i,..., ^„ are iid copies of Let Xn-i 
to reduce Theorem 2.1 to the following 


be an (n — 1) x {n — 1) minor of Xn- In this section we will 


Theorem 4.1. There exist positive constants c and ao such that the following holds with probability 1 — 
0(exp(—agn)).' for all n~‘^ < a < c and every unit eigenvector v of Xn-i and for all S > , 


We will discuss how to establish this result later in this section, and give the formal details of proof in later 
sections. Assuming this theorem for the moment, we now finish the proof of Theorem |2.1| To start, let B 
be the event that the spectrum of A„ belongs to a controlled interval. 


S = {w:Ai(A„),...,A„(A„)c [-10 Vr,10v^]}. (3) 

Standard results in random matrix theory (such as [I],[2S]) shows that B holds with probability 1 — 
exp(—0(n)). 

Now let 1 < * < n — 1, and consider the event Ei that satisfies A^+i — \i< 5n~^ . For each 1 < j < n, let 
Qij be the event Ei and that the eigenvector w = (wi,... ,WnY' with eigenvalue \i{Mn) satisfies \wj \ ^ T, 
where T is to be chosen later. The following delocalization result helps us to choose T properly. 

Theorem 4.2. Let Xn be a Wigner matrix as in Theorem \2. 1\ There are constants C 1 C 2 , C 3 > 0 such that the 
following holds. With probability at least 1 —exp(—cin), every eigenvector of Xn has at least C 2 n coordinates 
with absolute value at least The same statement holds for the adjacency matrix An{p), where p is 

a constant. (The constants Ci, 02,03 may depend on the distribution of the entries of Xn and on p in the 
An{p) case.) 
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We prove this theorem in Appendix [A} 


Write 




A* 



(4) 


where A is a column vector. From the Cauchy interlacing law, we observe that Ai(A„) < Ai(A„_i) < 
Ai +1 (A„). Let u be the (unit) eigenvector of Xi (A„); we write u = {w,b), where w is a vector of length n—1 
and 6 is a scalar. We have 


/A„_i 

A* 



Extracting the top n—1 components of this equation we obtain 


(A„_i-A(A„))zi; + 6A = 0. 

Let V be the unit eigenvector of A„_i corresponding to Ai(A„_i). By multiplying with v'^, we obtain 

|6n^A| = |n^(A„_i - A^(A„))'u;| = |Ai(A„_i) - Xi{Xn)\\v'^w\. 

We conclude that, if £i holds, then |6v'^A| < If we assume that Qi^n also holds, then we therefore 

have 


•j^Xl < S 


1-V2 

T ■ 


If one can choose T = n(n the RHS is 0{5), and thus we reduce the problem to boundi ng th e probability 
that A„_i has an eigenvector v such that |u^A| = 0(5), for which we can apply Theorem 


4.1 


Of course we cannot assume that the last coordinate b of the eigenvector of Ai(A„) to be large. Apparently, 
this eigenvector has a coordinate of order but a trivial union bound argument would cost us a factor 


n. We can avoid this factor by using Theorem 4.2 


Notice that there is no specific reason to look at the last coordinate. Thus, if we instead look at a random 
coordinate (uniformly between 1 and n and split A„ accordingly), then we have 


P{£^) < < 5^) + P{nT < A), 

where nr is the number of coordinates with absolute values at least T; and v,X are the corresponding 
eigenvector and column vector. 


Now choose T = c^n and N = C 2 n. By Theorem 4.2 P(nT < A) < exp(—cin). Thus, we have 
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P(£i) < C2 < Cg + exp(—cin). (5) 

To bound the RHS, we recall the definition of small probability at the beginning of the section. After a 
proper rescaling of i5 by a factor for any £ > 0 


P(k^^| < <5) < P(|w^A| < S\ps{v) < e) + P{ps{v) > e) < e + P{ps{v) > e). 


Set e := C- for a sufficiently large constant C, Theorem 4.1 then implies that 


Pflu^Xl <5)< C—j= + exp(—n(n)). 

\JOL 


( 6 ) 


This completes the proof of Theorem |2.1| assuming Theorem |4.1[ To prove Theorem |2.4[ we will follow the 
same strategy, but the analysis is more delicate. We refer the reader to Section for more detail. 


For treating the general model M„ := instead of Theorem 4.1 

the task of proving the following analogue of Theorem |4.1[ 


a similar argument reduces one to 


Theorem 4.3. For any A, 7 > 0, there exist ao, B > 0 depending on A and 7 such that the following holds 
with probability 1 — 0(exp(—oon)).' every unit eigenvector v o/M„_i = X^-i + F'„_i with eigenvalue A of 
order obeys the anti-concentration estimate 

Pn-^{v) = 0{n-^). 


Let us now discuss the proof of Theorem |4.1| and Theorem |4.3[ This proof relies on the general theory of 
small ball probability. Recent results from inverse Littlewood-Offord theory developed by the second and 
third author (see e.g. m, CHI) and by Rudelson and Vershynin (see e.g. [12]) show that if the vector x does 
not have rich additive structure, then the small ball probability ps{x) is close to <5. Thus, the key ingredient 
for proving Theorem |4.1| and Theorem |4.3| is to quantify the following 

Heuristic 4.4. With very high probability the eigenvectors of M^-i do not have additive structures. 


4.1. Notation. Throughout this paper, we regard n as an asymptotic parameter going to infinity (in par¬ 
ticular, we will implicitly assume that n is larger than any fixed constant, as our claims are all trivial for 
fixed n), and allow all mathematical objects in the paper to implicitly depend on n unless they are explicitly 
declared to be “fixed” or “constant”. We write X = 0{Y), X <^Y, or Y ^ A to denote the claim that 
|A| < CY for some fixed C] this fixed quantity C is allowed to depend on other fixed quantities such as 
the (2 -I- £o)-iiioment (or the sub-gaussian parameters) of ^ unless explicitly declared otherwise. We also use 
o{Y) to denote any quantity bounded in magnitude by c{n)Y for some c(n) that goes to zero as n —)■ 00. 
Again, the function c(.) is permitted to depend on fixed quantities. 

For a square matrix and a number A, for short we will write — A instead of — A/„. All the norms 
in this note, if not specified, will be the usual £ 2 "iiorm. 
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The rest of this paper is organized as follows. We first give a full proof of Theorem |2.1| and Theorem |2.4| in 
Section[^and Section]^ We then prove Theorem |2.6| in Sectionj^ and sharpen the estimates for Erdos-Renyi 
graphs in Section]^ The note is concluded by an application in Section 


5. Consecutive gaps for Wigner matrices: proof of Theorem 14.11 

In this section we prove Theorem |4.1[ which as discussed previously implies Theorem |2.1| Our treatment in 
this section is based on the work by Vershynin in [35]. First of all, we recall the definition of compressible 
and incompressible vectors. 

Definition 5.1. Let co,ci S (0,1) be two numbers (chosen depending on the sub-gaussian moment of ^.) 

A vector x G K" is called sparse if |supp(a:)| < c^n. A vector x G is called compressible if x is within 

Euclidean distance ci from the set of all sparse vectors. A vector x G is called incompressible if it is 

not compressible. 

The sets of compressible and incompressible vectors in will be denoted by Comp(co, ci) and Incomp(co, ci) 
respectively. 

5.1. The compressible case. Regarding the behavior of A„x for compressible vectors, the following was 
proved in [35] . 

Lemma 5.2. |361 Proposition 4.2] There exist positive constants co,ci and oq such that the following holds 
for any Aq of order 0{y/ri). For any fixed u G K” one has 


P( inf II (A„ — Ao)x — m|| <C-\/n) = 0(exp(—Oon)). 

aiGComp(co,ci) 


We also refer the reader to Subsection 7.1 for a detailed proof of a similar statement, 
deduce that eigenvectors are not compressible with extremely high probability. 


As a corollary, we 


Lemma 5.3. There exist positive constants co,ci and oq such that 


P (3 a unit eigenvector v € Comp(co,ci)) = 0(exp(— 

Proof of Lemma \5.3\ Assuming (|^, we can find Aq as a multiple of inside such that 

|A — Ao| < Hence 

||(X„ - Ao)r;|| = ||(A - Ao)i;|| < n~^. 

On the other hand, for each fixed Aq, by Lemma [5?^ 

P(3i; e Comp(co,ci) : |j(A„ - Ao)i;|| < n~^) = O(exp(-aon))- 


The claim follows by a union bound with respect to Aq. 


□ 
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5.2. The incompressible case. Our next focus is on incompressible vectors. This is the treatment where 
the inverse Littlewood-Offord ideas come into play. 


We first introduce the notion of least common denominator by Rudelson and Versynin (see [ 
parameters k and 7 (which may depend on n), where 7 S (0,1). For any nonzero vector x define 


Fix 


LCD 


k,'y{x) := inf > 0 : dist(0a;, Z") < min( 7 || 0 a;||, k)|. 


Remark 5.4. In application, we will choose k = n'^° for some sufficiently small kq. Regarding the parameter 
7 , it suffices to set it to be 1/2 for this section; but we will choose it to be proportional to a (from Theorem 
2.1) in the next section, hence 7 can decrease to zero together with n. The requirement that the distance 
is smaller than 7 || 0 a:|| forces us to consider only non-trivial integer points as approximations of Ox. The 
inequality dist(0x, Z”) < n then yields that most coordinates of Ox are within a small distance from non-zero 
integers. 

Theorem 5.5 (Small ball probability via LCD). |22j Let f be a sub-gaussian random variable of mean zero 
and variance one, and let fi,... ,£,n be iid copies off. Consider a vector x G M" which satisfies ||x|| > 1. 
Then, for every n > 0 and 7 G (0,1), and for 

1 


e > 


have 


LCD,,^(x)’ 


where the implied constants depend on f. 


To deal with symmetric or Hermitian matrices, it is more convenient to work with the so-called regularized 
least common denominator. Let x = (xi, ..., x„) G S'^~^ . Let cq, ci G (0,1) be given constants, and assume 
X G Incomp(co, Cl). It is not hard to see that there are at least c^cfinj^l coordinates Xfc of x satisfy 


Cl 


\/ 2 n 


< \Xk\ < 


1 

y/cfih' 


(7) 


Thus for every x G 
k G spread (x) and 


Incomp(co, Cl) we can assign a subset spread(x) C [n] such that Q holds for all 


|spread(x)| = [c'n]. 


where we set 


c' := coCi/4. 

Definition 5.6 (Regularized LCD, see also [36]). Let a G 
vector X G Incomp(co, ci) as 

LCD,,_^(x, a) = max|LCD,^_^(x//||x/||) : 


( 8 ) 

(0, c'/4). We define the regularized LCD of a 
I C spread(x), |/| = [cm] [. 
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Roughly speaking, the reason we choose to work with LCD is that we want to detect structure of x in 
sufficiently small segments. From the definition, it is clear that if LCD(a;) is small (i.e. when x has strong 
structure), then so is LCD (a:, a). 

Lemma 5.7. For any x G 5'"“^ and any 0 < 7 < Cx^faj^, we have 


Consequently, for any 0 < 7 < 1 


hCD^^^y^x, a) < 


Co 


-v^LCD,. 


7(ci V“/2) 


(a;). 


Proof of Lemma 5.1 Note that for any 1 C spread(a;) with |/| 


[an]. 


Assume that dist(ta;, Z") < min( 7 ||te|| 2 , n) for some t « LCDK,-Y(a:). Define 


ti := t||a;/||. 


One then has 


2 


y/o-t <ti < 


1 


VWnt. 


Furthermore, 

dist(t 7 a;//||x 7 |j, < dist(te, Z") 

< min( 7 ||te||, k) 

< min(7(^v^)"^p7(a;//|k/||)||,K)- 


Thus 


LCD„^^(^^^/2)-i(a;//l|a;/||) < O < -^xj\l\/nt < ^VaLCD^,.y(a;). 

□ 


We now introduce a result connecting the small ball probability with the regularized LCD. 
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Lemma 5.8. Assume that 


e > — y^CLCDli ^(x, a)) 
Co 


-1 


Then we have 


Pe(x) = 0( + 

\7CiVa 


Proof of Lemma\5^ First apply Theorem 5.5 to a; 7 /||a:/|| where the LCD is achieved: for any d > (LCDK, 7 (x//||a:/||))' 


PS 




where Sj = 

Recall that ^\/a < ||a;/|| < ^-s/a- ^Iso notice that if I C J C [n], then 


Ps{Sj) < ps{Si). 

Thus for any s > ^y/a{'LCTii/\\xi\\))~^, one has £:/||a:/|| > /\\xi\\))~^, and so 

Pe{x) < p,{Sj) = P,/||.,||(57/||x7||) « ^ 

7||a:/|| yciyo: 

□ 


For given D, k, 7 and a, we denote the set of unit vectors with bounded regularized LCD by 


TD,K,'r,a ■— {x e Incomp(co,ci) : LCD„_..),(a;,a) < D}. 

The following is an analog of [Ml Lemma 7.9]. 

Lemma 5.9. There exist c > 0,ao >0 depending on Cq,Ci from Lemma \5.3\ such that the following holds 
with K = and 7 = 1/2. Let n~'^ < a < c'/4, and 1 < D < . Then for any fixed u € K" and any real 

number Aq of order 0{^/n), 

P (3a: e TD,K.,'f,a ■ \\{Xn - Xo)x - m|| = o{/3y^)) = O(exp(-aon)), 


where 
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y/aD 


We will give a proof of Lemma 5.9 in Appendixby following |36j . Assuming it for now, we will obtain the 
following key lemma. 


Lemma 5.10. With the same assumption as in Lemma \5.9l we have 

P 1^3 a unit eigenvector v € Incomp(co,ci ) ; LCDk,j(v, a) < = 0 (exp(—oon)). 


Proof of Lemma [5. 1 0[ Set D = and j3 = . Assuming ([^, we first approximate A by a multiple of 

/3, called Aq, from the interval [—IQ^/n, lO-y^. 

As XnV = Xv, we have 


||(A„ - Ao)u|| = ||(A - Ao)u|| = 0(/3) = o(^v^). 


(9) 


On the other hand, by Lemma 5.9 holds with probability 0(exp(—aon)) for any fixed Aq. Taking a 
union bound over the 0{fi~^^/n) choices of Aq, one obtains 


P {3v G rD,K, 7 ,a, 3Ao G /3Z n [-lOy/n, lQ^/n] : || (A„ - Ao)?;|| = o{l3y/n)) 
<C exp(—ao»^) X l3~^Vn 
<C exp(— q!oV2 )- 


□ 


Putting Lemma [A3| and Lemma [5.10| together, we obtain a realization of Heuristic |4.4|as follows. 


Theorem 5.11. There exists a positive constants c and ao depending on co,ci from Lemma 5.3 such that 
for K = n ^'^,7 = 1/2 and for n~‘^ < a < c'/4, 


(^All unit eigenvectors v belong to Incomp(co, Ci) 


and LCD 


/t,7 


'v,a) » 


1 — 0 (exp(—Q;on/ 2 )). 


We can now complete the proof of Theorem 4.1 By the theorem above, it is safe to assume LCDk, 7 (u, a) >> 


for all eigenvectors of A„_i. As such, by Lemma 5.8 for any <5 ^ we have 


P5(S') < ^^ 

xJO- 


a 


( 10 ) 


where we recall that k = 
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ically implies S ^ ^/an 


— cjOL 


Theorem 2.1 then follows from (§ and Theorem |4.l[ where we note that the condition 6 > n automat- 


Remark 5.12. We could have proved Theorem 2.1 by directly applying the results from [36j . where LCD 
and LCD were defined slightly differently. However, the current forms o/LCD and LCD are easy to extend 
to higher dimensions, which will be useful for our proof of Theorem\2.4\ next. 


6. Proof of Theorem 12.41 


Let us first give a full treatment for the case 1 = 2. Here ci = 3 and we are considering Xi .^.2 — Xi. The case 
of general I case will be deduced with some minor modifications. 


6.1. Treatment for I = 2. First of all, we will introduce the extension of LCD to higher dimension, following 
Rudelson and Vershynin (see [23]). Consider two unit vectors xi = (a;ii,..., xin), X 2 = {x 2 i, ■ • ■, X 2 n) S K". 
Although the following definition can be extended to more general Xi,X 2 , let us assume them to be orthogonal. 
Let iLxi.xs C K" be the subspace generated by xi,X 2 . Then for k > 0 and 7 S (0,1) we define, 


'LCT>K. ffxi, X 2 )-.= inf l>CT)K-y(x). 


Similarly to Theorem 5.5 the following result gives a bound on the small ball probability for the . 
sum S = X]r=i 3 : 12 ) in terms of the joint structure of Xi and X 2 . 


^-random 


Theorem 6.1. [23| Let ^ be a sub-gaussian random variable of mean zero and variance one, and Zet ^ 1 ,..., ^„ 
be iid copies of f,. Then, for every n > 0 and 7 € (0,1), and for 

1 

£ ^ -, 

“ LCD„,^(a:i,a;2) ’ 


we have 

where the implied constants depend on f,. 


This theorem plays a crucial role in our task of obtaining the repulsion. 

We are now ready to present the main idea of the proof of Theorem |2.4[ Let 1 < f < n — 2, and consider the 
event that A„ has three eigenvalues Ai(A„) < Ai+i(A„) < Ai+ 2 (Af„) with Ai _|_2 — Xi < for some 8. 

For short, we set 


t := 

Write 


A 


n 


/A„_i 

A* 



( 11 ) 
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for an (n — 1) X (n — 1) minor JCn-i and a column vector X. 

From the Cauchy interlacing law, we observe that 

< Xi+i{Xn) < Ai_|_i(X„_i) < Ai+2(Aln). 

By definition, for some eigenvector {w,h) with |&| ^ 

Extracting the top n — 1 components of this equation we obtain 


(X„_i-A,(X„))u; + 6X = 0. 

If we choose unit eigenvectors v = (ti, ..., Vn-iY' of Xn-i with eigenvalue Xi{Xn-i), and w' = {v'l, 
of Xn-i with eigenvalue Xi+i{Xn-i) respectively, then 

\bv'^X\ = |u^(A:„_i - A*(X„))'u;| = |Ai(X„_i) - Xi{X„)\\v'^w\ < t] 


and similarly 


\bv'^X\ = - A,(X„))n;| = |A,+i(M„_i) - A,(X„)||z;'^zi;| < t. 


In summary, we have 


\v^X\ = 0{5) X\v'^X\ = 0{5). 


( 12 ) 


Let Xi denote this event. By Theorem 
be estimated quite efficiently. 


6.1 


infugij^ LCDK^.y(u, Of), and then using Lemma 


if we can show LCD 
In what follows we will focus on 

to pass back to LCD. 


K,.y{v,v') large, then the P(A^z) can 


LCD«, ~f{u) by first studying 


5.7 


We start with a simple fact first. 

Fact 6.2. For any unit vector u from Fly^y', we have 


||(X„_i-A,(X„_i))2u|| <t2. 


In particular, by the Cauchy-Schwarz inequality 
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||(X„_i-A,(X„_i))u|| <t. (13) 

Proof of Fact \6.S\ Assume that u = av + a'v' with of + a'^ = 1, then 

\\{Xn-i - UX^-i)fu\\ = ||(A „_1 - A(X„_i))(X„_i - A(A„_i))u|| 

= ||(A„_1 - A(X„_i))(X„_i - A(A„_i))u'|| 

= \a’ {Xi+i{Xn-i) — Ai(A„_i))^| 

<t'. 


□ 


Next, from Lemma 5.3 and Lemma 5.9 and by (13) from Fact 6.2 we infer the following. 


Theorem 6.3. There exist positive constants c,ao sueh that for n = = 1/2, and for any n ° < a < 

c'/d the following holds for any t > 


PI A„_i has two eigenveetors v,v' with eigenvalues |A —A'| < t and there exists u G H^y nIncomp(co, ci) 


such that LCD„_.y(M, a) < t ^K/\/a\ = 0(exp(— ooti/S)). 


Proof of Theorem \6.3\ Let 


D •.= t ^K,l\foL and /? := t. 


It is clear that with the given range of t one has 1 < H 
multiple of and also by (13) and by the triangle inequality 


By approximating Ai(A„_i) by Aq, a 


||(A„_i — Ao)m|| < ||(A„_i — Ai(A„_i))M|j + |Ai(A„_i) — Ao| — 0{/3). 


To complete the proof, we just apply Lemma |5.9| for these choices of D and /?, and then take the union 
bound over all 0{y/n[i~^) choices of Aq. □ 


Let £i be the event that A„_i has eigenvectors vy' with eigenvalues |Ai_|_i(X„_i) — Ai(A„_i)| < t, and 
£i A TLi be the event £i coupled with inf„g//^ u, a) 3> t ^k/o. We have learned from Theorem 

16.31 that 
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P{Si A Hi) = P(fi) - 0(exp(-aon/2)). 

On the other hand, our main result of the previous section, (10), implies that P(£i) = O ■ Thus 


P(£.AH.)<4=- (14) 

Now, condition on A for every u € we have LCD,; ,y(u,a) > It then follows from 

Lemma 15.71 that 


LCD«,.y(M,a) < 

We have thus obtained the key estimate 


LCD ^/2(i;,n')= inf LCD,, ^/2 (m) » t 


(15) 


Now we estimate P(Jy|i£’i A T-Li) (where we recall Jy from in)- As (5 = ^ (t ^n/a) and as Ci ,7 

are all fixed constants, by Theorem |6.1[ 


V{H£, A n,) « 


2 / J ^ 2 


(16) 


Combining (16) with (14), we obtain that 


P(J'i A fi) < ( + exp(-0(n)) < 


(17) 


Together with ([^, this completes the proof of the case 1 = 2 in Theorem 2.4 where the condition 5 >v} 
automatically implies the requirement t > of Theorem 6.3 as long as c is sufficiently small. 


6.2. Proof sketch for general 1. By the definition of d, we have 2'^ < I < 2“^+^, and that 


Ai(Ai„) < Ai(A’„_i) < • • • < Xi+l-i{Xn-l) < Xi+l{Xn). 


Let vi,... ,vi be the eigenvectors of Xn-i corresponding to Xi{Xn-i ),..., Ai+i_i(A'„_i). Then similarly to 
(12), with X being the last column of X^ in (11), we have the conjunction 


(KX|=0(5))A---A(|n2^.X|=0(<5)). 
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Also, as an analog of Fact 6.2 for any u G H,, 




and for t = we have 




So by Cauchy-Schwarz, 


||(X„_i-A,(X„_i))u|| <f. (18) 

Additionally, arguing similarly to the proof of Theorem |6.3[ if we let 5' be the event that Xn-i has eigen¬ 
vectors vi,... ,vi with eigenvalues |Ai+;_i(X„_i) — Ai(Ar„_i)| < t, and 5' A 'H' be the event coupled with 
LCD^ ~{u, a) 3> t~^Kja, then 


P(f ' A n[) = P{£') - 0(exp(-aon/2)). 


To this e nd, conditioning on 5' AH), it follows from (18), together with the small probability bound from 
Theorem 6.1 (applied to 2^ vectors), that 

Px {Iv^Xl = 0(6)) A • • • A i\v'^dX\ = 0{5))\£^ A Ui) < 


By iterating this conditional process up to Xn-i and proceed similarly to ( |17[ ), we obtain the following lower 
bound for the exponent of 6 


d-l 

^4fc + (Z_2‘^-hl)2‘^ 

k=0 


{31 + 3- 2‘^+^)2‘^ - 1 
3 


where we used the fact that the contribution to the exponent for the running index I' from 2‘^ to I is 
{l-2^ + 1)2^^, and for I' varying from 2^= to 2'=+i - 1 (with 0 < k < d - 1) is (2'=)(2'=) = 4'=. 


6.3. Remark. We finish this section with a short discussion on how to obtain a version of Theorem 12. ll for 
complex Wigner matrices with a probability bound of 0{{6/a)'^). 


First of all, let be a random Hermitian matrix where the real and complex components of the off-diagonal 
terms are iid copies of a sub-gaussian random variable of mean zero and variance 1/2. 


An identical proof of Theorem |5. 11 then implies that with probability 1 — exp(—0(n)), for any unit complex 
eigenvector v of Xn (including all complex phases e*®, 0 < 8 < 27r), the vector (3?(u), 3(w)) belongs to 

Incomp (Co, Cl) and 


LCD«,.^(3?(u),3(u),a) > n^/“. 


( 19 ) 
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Next, by (§, P(£i) is controlled by P(|r;^X| = 0(i5)), where v is any unit eigenvector associated to Xi. For 
short, write w = (^^(u),$ 5 ( 1 ;)),re' = (3(u), —and Y = (3fJ(X),9(X)), we then have 


P(|r;^X| = 0(6)) = P = 0{S) A |(w'^F)| = 0(<5)) . 


( 20 ) 


Thus we are in the position to apply Theorem 6.1 For this, we need to verify that the subspace does 

not contain any unit vector with small LCD. However, this is exactly what we have obtained in (see 

also the proof of Theorem 6.3). 


In conclusion, with S > n the bound of Theorem 6.1 then implies 


p (£.)«{^y 

yja. 


For the more general Theorem |2.4[ it looks plausible that we can double the value of c;, using a similar, but 
more tedious argument. We skip the details. 


7. Consecutive gaps for perturbed matrices: proof of Theorem 12.61 


In the perturbed case M„ = F„ + , the approach used in Section and Section does not seem to work. 

This is mainly because the norm of now blows up, and also because ^ is assumed to have bounded 
(2 + ,fo)-moment only. To handle this case, we will rely instead on the inverse Littlewood-Offord results 
developed by the second and third author. 

It sufhees to justify Theorem |4.3[ As we have seen, it is technically convenient to replace the concentration 
probability p^v) with a segmental variant ps,aiv), and to work with two related scales 6 rather than a single 
scale S. 


More precisely, for any i5>0, 0<a<I and any u S M", let ps,a{v) denote the quantity 

P5.a(T) := inf P6{v ii) 

where v [ 1 = (vi^,... ,Vi^) is the restriction of u to / = {ii,... ,irn} with ii < ■ ■ ■ < im- 


We observe the easy inequality 


Ps{v) < PS,a{v). 


( 21 ) 


We will assume that the matrix has operator norm bounded by rY , for some 7 ' > maxjy, 1/2}, with 
probability at least 1 — 0{n~^). Thus for instance if ^ has subgaussian distribution, then one can take 
7 ' = max{ 7 ,1/2}. (One can also see that 7 ' < A/2 + 7 + 4 in any case.) 


For the rest of the treatment of Theorem 4.3 we will choose 


a := 


1 


n' 


( 22 ) 
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for some sufficiently small constant e depending on A and j'. 

We now reduce Theorem 14.31 to 

Theorem 7.1. With probability at least 1 — 0(exp(—aon)) for some positive constant ao independent of n, 
there is no unit eigenvector v of Mn with an eigenvalue A of order 0{n'^ ) with the following property: there 
is a radius S with n~^ < S < such that 




To deduce Theorem 4.3 we assume otherwise that Pj^-b{v) > n Define the sequence with 

J := [Al/0.49], (5o := n ^ , and 

Sj+i := Sj 

for 0 < j < J. By assumption, pso,a{'o) > n~^, and if we choose B so that 


B > 2 J 7 ', 


(23) 


then because the p’s are bounded by one, there exists j < J (and so n ^ < Sj+i 


< n such that 


PSj + i,aiv) < n- PS 


It remains to establish Theorem 7.1 this is the objective of the rest of this section. 


7.1. The compressible case. Let us first consider the easy case in which there e xist s n~^ 
such that ps,ol{v) > (an)“^Z 2 +e^ ^j^jg ^g^gg jg gjniilar to our treatment of Subsection 
be the following Erdds-type inverse Littlewood-Offord theorem: 


5.1 


< S < n 
Our main tool will 


Theorem 7.2. [TO] Let e > 0 be fixed, let 5 > 0, and let v G K"* be a unit vector with 

ps{v) > 

Then all but at most em of the coefficients of v have magnitude at most 6. 


Now as g = ps,a{v) > (cm)“^Z2+e^ Theorem |7.2[ we see that for every J C {1,... ,n} with |/| = \an\, all 
but at most 0{e\I\) of the coefficients of v [j have magnitude at most S, where the implied constant does 
not depend on e. By a simple covering argument, we conclude that |ui| <6 for all i outside of an exceptional 
set S' C {1,..., n} of cardinality |S| = 0{ean) = 0{n^~^). 

If we let £s denote the event that the above situation holds for a given value of S, the probability that the 
conclusion of Theorem |7.1| fails may thus be bounded via the union bound by 

P(Ss)+ O(exp(-0(n))). 

5c{l,...,n}:|S|—0(Q:n) 

By Stirling’s formula, the number of possible exceptional sets S is at most Thus it suffices to 

show that 
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p(£s) < exp(-aon) 


( 24 ) 


uniformly in S and some fixed oq > 0 independent of e. 


By symmetry we may take 
for some n' = (1 — 0{a))n. 


S' = {n' + 1,..., n} 


Now suppose that the event Ss holds, and let v, X be as above. We split 


Mn 


(Mn, D \ 
\ D* Mn-n') 


where M„/, Mn-n' are the top left n' x n' and bottom right (n — n') x (n — n') minors of M„ respectively, 
and D is a n' X (n — n') matrix whose entries are independent copies of ^ (modulo the deterministic part 
from Fn), and also independent of We also split with v' G M” and v" G . 


By hypothesis, all entries of v' have magnitude at most S = 0{n and so provided that B > 1, 


llu'll < ^ 


(25) 


and thus (since u is a unit vector) 


I < lk"ll < 1- (26) 

From the eigenvector equation 



we have 


(M„/ - X)v' + Dv" = 0. 


(27) 


Hence, 


\\Dv"\\ = 0{n'^'Sn^) = 0 (n--? + V7'). (28) 

On the other hand, by a standard epsilon-net argument (noting that n — n' = 0(n^“®)), with probability 
1 — 0(exp(—ao'fi)) for some fixed oq > 0 independent of e, we have 


inf \\Dw\\ n =. 


tueR’ 


(29) 



EIGENVALUES GAP 


23 


We hence obtain (24) provided that 


B > 27 ' + 2. (30) 

7.2. The incompressible case. Now we assume that there exists a radius <5 with n~^ < d < such 

that 


n ^ ^ and < (an) 


In this case, Theorem 7.2 is insufficient to control the ’’rich” vector v. Instead, we will rely on the more 
general inverse Littlewood-Oflord theorem from [19] (see also |28l [29|). Define a generalised arithmetic 
progression (or GAP) to be a finite subset Q of the real line K of the form 

Q = {aiu>i + • • • + OrWr : Oi G Z, |ai| < Ni for alH = 1,... ,r} 


where r > 0 is a natural number (the rank of the GAP), A^i,..., Nr > 0 are positive integers (the dimensions 
of the GAP), and Wi ,..., G K are real numbers (the generators of the GAP). We refer to the quantity 
nl=i(2-^i + 1) as the volume vol((5) of Q\ this is an upper bound for the cardinality \Q\ of Q. We then have 

Theorem 7.3 (Continuous inverse Littlewood-Offord theorem). Let e > 0 be fixed, let d > 0, and let v G K" 
be a unit vector whose small ball probability p := ps{v) obeys the lower bound 

p » 


Then there exists a generalized arithmetic progression Q of volume 


vol(( 5 ) < max 




such that all but at most en of the coefficients vi,...,Vn of v lie within 6 of Q. Furthermore, if r denotes 
the rank of Q, then r = 0(1), and all the generators Wi,... ,Wr of Q have magnitude 0(1). 


We now begin the proof of Theorem 7.1 with the setting 


q ■■= P5,a{v) < {an) 


Pn-y'Sai''^) — there must exist a subset I of {1 ,... ,n} of cardinality |/| = [anj with 


Pn-i'si'i’ L/) < n- 


(31) 


For each J, let £) be the event that the above situation occurs, thus the conclusion of Theorem 7.1 holds 
with probability at most 

^ ?(£■;)+ O(exp(-0(n))). 

/C{l,...,ra}:|/| = LaraJ 

We crudely bound the number of possible J by 2"’. The key estimate is stated below. 
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Lemma 7.4. We have 


?(£■;) 


This will establish Theorem |7.1|in the incompressible case if e is chosen small enough. 


Our approach to prove Lemma 7.4 is somewhat similar to [32]. First, by symmetry we may assume that 
/ = {1,..., fc}, where k := [an\. Similarly to the previous section, we split 




(Mk D \ 
\D* 


where M^, Mn-k are the top left k x k and bottom right {n — k) x [n — k) minors of Mn respectively, and 
D IS a k X (n — k) matrix whose entries are (modulo the deterministic part from Fn) independent copies of 
and also independent of We also split = {v\ v")^ with v' G and v” G 

Now suppose that Mn is such that the event S'j holds, the heart of our analysis is to approximate (u. A) by 
low-entropy structures. 

Lemma 7.5. There exists a subset M o/K^xK^xK of size such that for any eigenvector- 

eigenvalue pair {v, A) satisfying £j, there exists {v, w', X) G Af which well-approximates (v, A) in the following 
sense 


• (general approximation of eigenvector) \vj — Vj\ < S, for 1 < j < n; 

• (finer approximation over segment) \v) — w)\ < n~'^ for 1 < j < k; 

• (approximation of eigenvalue) |A — A| < n~~* 

We also refer the reader to Lemma [B.l| in Appendix [B] for similarities. 


Proof of Lemma We cover {1,..., n} by sets Ii,... ,Ijn of of length differing by at most one, with 


m < 


1 

a 


+ 1 = 0(n"). 


Because psiv) = q, for each j = 1,..., m we have 


Ps{v [h) > q- 


Applying Theorem 7.3 and the incompressibility hypothesis q < (an) we may thus find, for each 


i = 1..., TO, a GAP Qi such that 


- 1 / 2 +./, 


vol(Qi) < (an) 


(32) 
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and rank = 0(1) such that all but at most 0(£^n) of the coefficients of v [i- lie within a distance 0{S) of 
Qi- Thus we have 


m 

dist(7;j, [J Qi) < (5 


2=1 

for all j = 1,..., n outside of an exceptional set S C {1,..., n} of cardinality 

Furthermore, all the generators Wi^i,... ,Wi^ri of Qi have magnitude 0(1). 
bound for the dimensions of Qi 


l^l <C en. 


From (321 we have the crude 


Ni^i ,..., Ni^n < (an) 

From this, we may round each generator Wi^i to the integer nearest multiple of qS (say) without loss of 
generality, since this only moves the elements of Qi by 0((an)“^/^+‘^(5) = 0{S) at most. In particular, all 
elements of IJuLi Qi integer multiples of qS. 

Thanks to this information, one can create a coarse “discretized approximation” v = (ui,... ,{i„) to v, by 
setting Vj for j = 1,..., n to be the nearest element of UlIi Qi if ii®® within 0{5) of this set, or the 
nearest multiple of q6 to Vj otherwise. Then vj consists entirely of multiples of qS, one has 


\vj-vj\<d (33) 

for all j = 1,..., n, and all but at most 0{an) of the coefhcients of v lie in IJ™ Qi- 

We will also need a finer approximation w' = {wi, ... ,Wk)'^ to the component v' = (ui,. .. ,Vk)'^ of v, by 
choosing Wj to be the nearest integer multiple of S to Vj, thus for all j = 1,..., k. 

\vj — Wj\ < n~'^ 6. (34) 

Similarly, one approximates the eigenvector A by the nearest multiple A of n~^ thus 


|A-A| (35) 

We now claim that the data u, w\ A have low entropy, in the sense that they take a relatively small number 
of values. Indeed, the number of possible ranks ri,..., of the Qi is (0(1))"* = (0(1))^'^“. Once the ranks 
are chosen, the number of ways we can choose the generators Wiq, of the Qi (which are all multiples of qb of 
magnitude 0(1)) are 


0((1/q<5)^-i"*) = 0(n°(i/“)) = 0(n°("')) 
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since q,S > n by hypothesis. 

The number of sets S may be crudely bounded by 2”. For each j € S, the number of choices for vj is 
0{l/qS), leading to 0((l/g^)l‘^l) = 0{{l/q6)^^°'^^) ways to choose this portion of v. For j ^ S, Vj lies in 
U^iQi, which has cardinality 0{mvol{Q)) = /q). Finally, for the k = \an\ coefficients of w' 

there are at most 0(n^ 1^) choices, and there are similarly 0(n'’' choices for A. 

Thus the total possible number of quadruplet]^ {v, w', A) is at most 


(0(l))i/“ X X 2” X x 0((in-i/2+“/g)") x 0((nT''+Vg)L“"J) x Oin^^^qf, (36) 


which simplifies to 


0(„-n/2+0(an)^-n)^ 


Note that many of the factors in 
e is chosen sufficiently small. 


the previous expression (36) can be absorbed into the 


error as 


□ 


Now we assume that (u. A) is an eigenvector-eigenvalue pair satisfying £), which can be approximated by 
tuple {v,w', X) S A/” as in Lemma 7.5 The lower n — fc-dimensional component of the eigenvector equation 


MnV = Xv then reads as 


D*v' + (M„_fc - A)u" = 0. 


From (34) and (35) we certainly have 


||(A-A)u"|i <(5and |liA*(u'- u;')ll < <5- 


By (331, we have 


||u"-u"|| <nh 


where v" is the lower n — k entries of v. 

Since A has order 0{n> ), we have A = 0{n'^ ), and so 


^One could also expand this set of data by also adding in the GAPs Qi,..., Qm and the set S, but we will have no need of 
this additional data in the rest of the argument. 
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Hence, with u = {Mn-k — A)z;" independent of D, 


\D*w' - m|| = \\D*w' - (M^-k - A)0"|| 

= \\D*iw' - v') + D*v' + {M^_k - X)v" + {M^-k - X){i" - v") + (A - \)v" 
< in? ^^5. 


(37) 


Let us now condition Mn-k to be fixed, so that u is deterministic. Let xi,..., Xn-k G denote the rows of 
D] then the xi,... ,Xn-k are independent vectors, each of whose elements (up to a deterministic part from 


Fn) is an independent copy of The bound (37) then can be rewritten as 


'i—k 


\xjw' — ^^(5^ 


(38) 


i=l 


where ui,..., Un-k S ffi are the coefficients of u. 


In summary, if we let event that the above situation holds for a given choice of v,w',X, it 


will suffice by the union bound (taking into account the cardinality of Af from Lemma 7.5) to establish the 
following upper bound 

Lemma 7.6. For any (y,w',X) G N, 

P(^sw,a) = 0((n-4®+^(“)g)"). (39) 


To justify (39), we first recall from © that Pnt'siX) < n From ( [M| , the random walks (with the Xi) 
associated to v' and w' differ by at most 0(n“^ and so 

Pn^'5l2{W ) <n- q. 

We now invoke the following tensorization trick (which is not strictly necessary here, but will be useful later). 

Lemma 7.7. [221 Lemma 2.2] Let Ci, ..., be independent non-negative radom variables, and let K, > 0. 
If one has 

P(Cfc <t) < Kt 

for all k = 1,... ,n and all t > tQ, then one has 


P{Y,Cl<t"n)<Oi{Ktr) 


k=l 


for all t > to- 


7.7 


By setting t = n'^ S/2, it follows crudely from Lemma 

n—k n—k 

Ki: 


that 


IxTw' - Vif 


i=l 


n—K 

= 0(71^^' + ^^)) <P(X] 


T / |2 

X, W -Vi\ 


2=1 


= oin?))=0{{n-^\r->^), 
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where as before we are conditioning on M^-k being fixed. 


Undoing the conditioning, we conclude that (381, and hence (371, occurs wit h pr obability = 

Q((^.49+o(Q)j^)n)^ and thus (39) follows. This concludes the proof of Theorem 7.1 in the incompressible case. 


Finally, the conditions for B from (23) and (30) can be secured by choosing 


B > 5^7' + 27' + 2. 


(40) 


8. Consecutive gaps for Erdos-Renyi graphs: proof of Theorem l2.7l 


By modifying the treatment of Section and using (§, we will show the following. 


Theorem 8.1 (Most eigenvectors poor). Let >1 > 0 and a > 0 be fixed. Then, with probability at least 
1 — 0(exp(—agii)) for some positive constant og > 0 (which may depend on A,a), every unit eigenvector v 
of An with eigenvalue X in the interval obeys the concentration estimate 


for all 6 > n 


ps { v ) < n'^d 


In fact, we shall reduce Theorem |8.1| to 

Theorem 8.2. Let a > 0 be fixed, and let S > n~^ and q > n°'6. Let a > 0 be a sufficiently small fixed 
quantity. Then with probability at least 1 — 0(exp(—ogn)) for some positive constant Og > 0, every unit 
eigenvector v of An with an eigenvalue A in the interval [—lO-^n, lOy^], one either has 

ps,a{v) < q 
or 

Pni/^+o.s,a{v) > n^l'^-'^q. (41) 

We remark that, unlike in Section a is a (sufficiently small) constant here. It is not hard to see that 
Theorem |8.2| implies Theorem |8.1| Indeed, let An, A, a be as in Theorem |8.1| We may assume <5 < 1, as the 
claim is trivial for <5 > 1. By rounding S up to the nearest power of and using the union bound, it 

suffices to show that for each given such choice of n~^ < S < 1, with probability 1 — 0(exp(—ogn)) all unit 
eigenvectors v of An with eigenvalue in the interval [—10i/n, 10y/n\ satisfy the bound 


ps { v ) < (42) 

Let a > 0 be a sufficiently small fixed quantity, and set J := [2 a 1J . For each j = 0,..., J + 1, we define the 
quantities 


Sj := and Kj := ^ 


Now suppose that (42) fails. We conclude that 
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PSo,a{'^) > PSo{v) > KoSo- 

On the other hand, if a is sufficiently small depending on cr, A, 


PSj,a{v) < i < Sj < Kj5j. 


Thus there exists 0 < j < J such that 


PSj,a{v) > Kjdj] and pSj+i,a{v) < Kj+idj+i. 


Consequently, 


Pni/2+<»A. 


,{v) < 


But by Theorem |8.2| (with a replaced by cr/4, say, and 6 and q replaced by Sj and KjSj) and by the union 
bound over the J different choices of j, this event can only occur with probability 0(exp(—aon)) for some 
positive constant ao, and Theorem |8 .1 1 follows. 


In what follows we will establish Theorem |8.2[ Again, similarly to the proof of Theorem 7.1 we divide into 
two cases. 


8.1. The compressible case. Assume that q > n 1/^+“ and ps^ai^) > q- By a simple covering argument, 
we have 


P5n ~^/^+^^ 


- 1 / 2+0 


Set S' := Sn jq. Proceed similarly to Subsection 7.1 we just need to show 


P('S’s) = O(exp(-aoii)), (43) 

uniformly in S' C [n] with |S| = [anj and some fixed oq > 0 independent of a. Here £s is the event 
Ps'iv i S) > 

By symmetry we may take S = {n' + 1,..., n} for some n' = (1 — 0{a))n. Now suppose that the event £s 
holds, and let v, A be as above. We split 


Ar, — 


An* 

D* 


D 

A 

■^n.—r 


and V = 
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where An', An- 
and v' € M" and V 
in Subsection 


7.1 


are the top left n' x n' and bottom right (n — n') x {n — n') minors of An re spectively, 
, and the entries of v' are bounded by 5' by applying Theorem 7.2 Again, as 


we have ^ < 


W'W < 1, and the eigenvector equation AnV = Xv implies that 


\\Dv" - aln' II = 0{nd') = Oin^/^+’^S/q) = (44) 

for some scalar a S K, where D = D — pln'ln-n' mean zero normalization of D, and we note that by 

and by the bound on A, 


||A„/ - A-pl„/l^,|| = 0(Vn). 


However, with probability 1 — 0(exp(—aon)) for some positive constant ag independent of oc, we easily have 


inf inf \\Dw — al„'|| >> y/n. 


(45) 


In fact, without the al term, as noted at the end of Subsection |7.1[ the left-hand side becomes the least 
singular value of D, and the result follows from m Theorem 3.1]. The epsilon net arguments used there 
can be easily modified to handle the additional al„' term. Alternatively, we can argue as follows. We group 
the n' rows of D into pairs, possibly plus a remainder row which we simply discard. For each such pair 
of rows, we subtract the first row in the pair from the second, and let D' be the xn — n' matrix formed 
by these differences. From the triangle inequality we see that ||71^u>|| < 2||Zlui — al|| for all w G and 

a G IR. From [H Theorem 3.1] we have 


with probability 1 


inf ll^^^will » -x/n 


0(exp(—agi^))) and the claim (451 follows. 


Comparing (45) with (44) (after rescaleing v" to be of unit length) and with (42), we obtain a contradiction 
with probability 1 — 0(exp(—agi^))) and hence (43) follows. 


8.2. The incompressible case. In this case there is a unit eigenvector v = (vi,... ,VnY' of with 
eigenvalue A G [—lQ^/n, lOyAi] so that 


/35.a(w) > 9; and p„i/ 2 +c 5 _„(u) < (46) 

From the second inequality, there must exist a subset J of {1,..., n} of cardinality |/| = \ an\ with 


PnU2+o.s{v (/) < "^q. 


(47) 


For each I, let £'j be the event that the above situation occurs, thus the conclusion of Theorem 8.2 holds 


with probability at most 
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+ O(exp(- 0 (n))). 

/C{l,...,n}:|/| = [araJ 


It suffices to justify the following improvement of Lemma 7.4 for £j. 
Lemma 8.3. We have 

P{£j) < 

where the implied constant in the 0(.) notation is independent of a. 


By symmetry we may assume that I = {1,..., fc}, where k := [anj. Again, we split 



where A^, An-k are the top left kxk and bottom right {n—k) x (n— k) minors of An respectively. We also 
split with v' e K'" and v” e 


Now suppose that A„ is such that £j holds. Then by invoking Theorem 7.3 we obtain the following variant 
of Lemma 7.5 (where 7 is replaced by, say 100). 


Lemma 8.4. There exists a subset Af of xR of size 0{n ^/^+ 0 {an)q such that for any eigenvector- 

eigenvalue pair (u, A) satisfying E'j, there exists (v,w',A) € Af which approximates (u, A) as follows 


• \vj — Vj\ < S for 1 < j < n; 

• \vj — Wj\ < for 1 < j < k; 

• |A — A| < 


Proof of Lemma \8.3\ Let us now assume that (u,A) can be approximated by tuple {v,w',X) S A/” as in 
Lemma 8.4 The partial eigenvector equation D*v' + {An-k — X}v" = 0 would then imply that (assuming 

(§) 


\\D*w'- u\\ = 0{nS) (48) 

where u G is the vector (An-k — A)w". 

We condition An-k to be fixed, so that u is deterministic. Let xi,... ,Xn-k G denote the rows of D. The 


bound (48 1 then can be rewritten as 


n—k 


\xfw' — Uip = 0{n^5^) 


(49) 


where iti,..., Un-k G ® are the coefficients of u. 


Now, by (47), Pni/ 2 +as{v') < combine this with the approximation from Lemma 8.4 we obtain 
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p„i/2+<.5/2(w') < ^q. 

Thus, by a simple covering argument, for every 5" > one has 


P{\xf w'— Vi\ < S”) = o(^n q/^yi = 1,... ,n — k. 


It follows from (50) and from Lemma 7.7 that for every 5" > 


'i—k 


P( ^ |a;f w' - < 5”^{n - fc)) = 

In particular, with 5" := n^/^+“(5/2, and by discarding a factor of n“. 


71—k 

2=1 




( 50 ) 


where we are conditioning on An-k being fixed. Undoing the conditioning, we conclude that (49) and 


hence ([4^ occurs with probability This concludes the proof of 

Theorem |8.2| in the incompressible case by the union bound over all tuples {v, w', A) from M. 


□ 


9. Application: non-degeneration of eigenvectors 

Using the repulsion between eigenvalues, we can show that the entries of eigenvectors are rarely very close 
to zero. 

First of all. Theorem |2.6| easily implies the following weak stability result. 

Lemma 9.1. For any A > 0, there exists B > 0 depending on A and 7 such that the following holds with 
probability at least 1 — 0{n~^): if there exist A S K and v G 5'”“^ such that ||(M„ — A)w|| < n~^, then Mn 
has an eigenvalue Xi^ with a unit eigenvector Uig such that 

I Aio - A| < and ||?; - u^g || <C 


Proof of Lemma 9.1 First, by Theorem 2.6 we can assume that 


I A* - Xj\>n ^ j. 


( 51 ) 


Assume that v = Ei<i<n CiWi, where cf = 1 and ui,... ,Un are the unit eigenvectors of M„. Then 
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||(M„ - A)t>|| = (^c2(A, - A)2)1/2 b 

i 


( 52 ) 


Let iQ be an index with > \l\fn, then ||(M„ — A)t>|| < n ^ implies that |A — Aij,| < n b+i/ 2 ^ 
(51), |A — Ail > for all i ^ io- Incorporating with (52) we obtain 


|ci| = 0{n ^ io- 


Thus 


||u-UiJ| < n 

□ 


We next apply Theorem |4.3| and Lemma [9T] to prove Theorem |3.3[ 


Proof of Theorem \3.3[ It suffices to prove for i = 1. The eigenvalue equation MnV = Xv for with |?;i| < n ^ 
can be separated as 


{Mn-i — X)v' = —vxX] and v'^X = uiTOh, 

where X is the first column vector of M„, and hence independent of M^-i- As B is sufficiently large 
compared to A and 7 , and by adding a small amount of order 0{n~^) to the first component of v' so as to 
11?;'11 = 1 , we can reduce the problem to 

P (dA gR,v' G - A)u'|| < and \v'^X\ < 


To this end, by Lemma 9.1 if ||(M„_i — A)?;'!! < n then Mn-i has a unit eigenvector u' with ||u' —?;'|| <C 
t^-b/8. gQ would imply that 




However, by Theorem 4.3 (53) holds with probability 0(n This concludes our proof. 


(53) 

□ 
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Appendix A. Proof of Theorem 14.21 


Lemma A.l. There are positive constants 71 , 72,73 (depending on the sub-gaussian moments) with 72 < 1 
such that the following holds with probability at least 1 — exp(— 7 in) with respect to the symmetric Wigner 
matrix A„ as in Theorem 2.1 For any unit eigenvector u = (mi, ... ,Un) of Xn and any subset S C {1,..., n} 
of size 72 R, 


XI - T'3- 

ieS 


The same statement holds for A„(p). 


Theorem 4.2 follows easily from this lemma. Indeed, consider a unit eigenvector u of A„ 
of coordinates with absolute value at least (1 — 73 ) 77 .“^/^. Then 


Let S be the set 


( I - ^ > 73. 

ies 


By Lemma any set S with this property should have size at least 72 R, with probability at least 1 — 
exp(— 7 in). We obtain Theorem 4.2 by setting ci = 71 , C2 = 72 and C3 = (1 — 73). 


Lemma A.l was proved for A„(p) in Theorem 3.1] in a different, but equivalent form. The proof extends 
with no difficulty to Wigner matrices. We also refer the reader to [21] for related results of this type. 


Appendix B. Proof of Theorem 15.91 

The treatment here is based on |3b] . For short, we will write Tjj instead of rD,K, 7 ,a- The key ingredient is 
finding a fine net for To. 

Lemma B.l. Let < a < c'/4. For every D>1, the level set To accepts a 0{ ^j^ )-net Af of size 


|A^I< 


(cpr ^ 2 /. 


where C is an absolute constant. 


Roughly speaking, the principle is similar to the proof of Lemma [7A| and Lemma [ 8 ^ We will break x £ To 
into disjoint subvectors of size roughly m, where m = [cm], and find appropriate nets for each. In what 
follows we will be working with first. 

Definition B.2 (Level sets). Let Dq > c^/m. Define Soq C 5'™“^ to be the level set 


m—1 


Soo ■■= {x G 


: Dq < LCD^ ,y(a:) < 2Dq). 
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Notice that k = o{y/rn) because k = and m> We will invokde the following result from [23| . 

Lemma B.3. [23l Lemma 4.7] There exists a {2k/ Do)-net of Sdq of cardinality at most {CqDq/, 
where Cq is an absolute constant. 

As the proof of this result is short and uses important notion of LCD, we include it here for the reader’s 
convenience. 

Proof of Lemma \B.3\ For x G denote 


D{x) := LCDk, 7 (x). 

By definition, Dq < D{x) < 2 Dq and there exists p G Z™ with 


P 

D(x) 


< 


D{x) 


= O 




0 ( 1 ). 


As ||a;|| = 1, this implies that ||p|| « D{x), more precisely 


D{x) ^ 


P 

D{x) 


< 1 + 


K 

D{x)' 


This implies that 


INI < (l + o(l))D(N <3Do. 


(54) 


(55) 


It also follows from (541 that 


p 

X — T, — TT 

< 

p 

X - 

+ 

INI 


D{x) 



P ( Ibll 

iNPi^N: 


- 1 ) 


< 2 


K 2k 
D{x) ~ Dq 


(56) 


Now set 


A/q := 


P 

INI 


pG Z™nB(0,3L>o) 


By (551 and (561 , A/q is a ^-net for Sdq- On the other hand, it is known that the size of A/o is bounded by 


{Cq-^)"^ for some absolute constant Cq. 


□ 


In fact we can slightly improve the approximations in Lemma |B.3| as follows. 
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Lemma B.4. Let c^/m < Dq < D. Then the set Sdq has a (2k/ D)-net of cardinality at most (C^D/ 
for some absolute constant Cq (probably different from that of Lemma B.3). 


Proof, (of Lemma B.4) First, by Lemma B.3 one can cover Sdq by {CqDq /balls of radius 2k/ Dq. We 
then cover these balls by smaller balls of radius 2k/ D, the number of such small balls is at most {0{D/Dq))'^. 
Thus there are at most {0{D / y/m))'^ balls in total. □ 


Taking the union of these nets as Dq ranges over powers of two, we thus obtain the following. 

Lemma B.5. Let D > Cy/m. Then the set {x € : Cy/fn < LCDK,T,(a;) < D} has a (2k/ D)-net of 

cardinality at most 


(CoD/Vmriog^D, 


for some absolute constant Cq (probably different from that of Lemma B.3 and Lemma B./). 


We can also update the net above for x without normalization. 

Lemma B.6. Let D > Cy/m. Then the set {x G M™, ||a;|| < l,Cy/m < LCD„_-),(x/||x||) < D} has a 
(2k/ D)-net of cardinality at most (CqD / y/rri)"^ D^ for some absolute constant Cq. 


Proof of Lemma \B.6\ Starting from the net obtained from Lemma B.5 we just need to 2K/D-approximate 
the fiber of span(x/\\x\\) in B™. □ 


We now justify our main lemma. 


Proof of Lemma m We first write x = xjgU spread(x), where spread(a;) = Ii U • • • U U J such that 
= (cun'] and |J| < an. Notice that we trivially have 


|spread(a;)| > |/i U • • • U | = kQ\an'\ > |spread(a:)| — an> (in/2. 


Thus we have 


2a 


< fco < 


2c' 

a 


In the next step, we will construct nets for each xi.. For xi^, we construct trivially a (l/I?)-net Mq of size 


\Mq\ < (3Df«/ 


For each as 
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LCD«_^(a;/j^/||a;/J|) < LCD^_^(a;) < D, 


by Lemma 


B.6 


7.2 


(where the condition "LCTif^^^{xi^^/\\xi^\\) ^ \/\Ik\ follows from, say Theorem 
the entries of Xii^/\\xi^. are all of order y/an while k = o(y/an)), one obtains a (2K/D)-net A4 of size 


because 


|A4| < 





D^. 


Combining the nets together, as a; = (xjg , xj^ , • ■ •, xj^^ , xj) can be approximated by y = {yj^ , yj^^ , yj) 

with \\xi. -yi^W < we have 


\x-y\\ < \/fco + < ^77- 

V yJaD 




As such, we have obtain a /5-net M, where P = of size 


ko 

|V| < 2"|M)||Vi|... IVfcJ < 2" (37^)1^“I n 

k=l 



\Ik\ 


D^. 


This can be simplified to 


|V|< 



£)0{l/a) 


□ 


Before completing the proof of Lemma 5.9 we cite another important consequence of Lemma 5.8 (on the 
small ball estimate) and Lemma [7.7| (on the tensorization trick). 


Lemma B.7. Let x G Incomp(co, ci) and a G (0,c'). Then for any /3 > -^y/a{'LC'DK.,~t{x,a)) ^ one has 


Pp^iiX^ - Xo)x) < . 

Proof of Lemma [jT^ It suffices to show the result for the level set {x G 7z)\T£)/2}. With /3 = 77 = 77 , the 
condition on /? of Lemma |B.7| is clearly guaranteed. 


K 


y/aD 


» 


y/a 


D ■ 















38 


HOI NGUYEN, TERENCE TAO, AND VAN VU 


This lemma then implies 


P (3x e Td\Td/^ : II (X„ - Ao):r - m|| = o(/3v^)) < 

\ aD 

< 

provided that k = v?'^ with sufficiently small c compared to c', and that D < 


{CDY 




jjOil/a) 


{CDY 


{y/anY''^^'^ 


]jO{l/c 


□ 
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